NextAI Compute
Requesting Quota Increase
NextAI model deployment on cloud may encounter quota limitations when provisioning resources, especially for high-end GPUs. To request a quota increase, follow these steps:
AWS (Amazon Web Services)
AWS (Amazon Web Services)
- Go to the EC2 Quotas console.
- Select your desired region from the top right.
- Choose an EC2 instance type from the list.
- Click the quota name and select “Request quota increase.”
- Enter the new quota value you need.
- Click “Request.”
GCP (Google Cloud Platform)
GCP (Google Cloud Platform)
- Go to the Google Cloud Console Quota page.
- Filter by “Service: Compute Engine API.”
- Select the desired limit name (e.g., NVIDIA-L4-GPUS-per-project-region).
- Check the region you want to change the quota for.
- Click “Edit Quotas” and set the new limit.
- Submit your request.
Azure
Azure
- Visit Azure’s quota page.
- Select “Request Increase” at the top.
- Choose “Compute-VM (cores-vCPUs) subscription limit increases” for Quota type.
- Follow the steps to specify details, including regions and VM series, and enter the new vCPU limit.
- Confirm your contact details.
- Review and create your request.
After submitting the request, the support team will review it. To increase your chances of approval, be responsive to any inquiry emails regarding how you plan to use the requested resources for your NextAI projects.
These steps will help you request a quota increase for cloud resources, ensuring you have the necessary capacity to support your NextAI initiatives.